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Abstract. In this paper, we look at the types of errors that are introduced when using the 
Euler characteristic integral approach to count the number of targets in a discrete sensor field. 
In contrast to other theoretical analyses, this paper examines discrete sensor fields rather than 
continuous ones. The probabilities of first and second order errors are worked out combinatorially, 
and a general formula is found which is discovered to be proportional to much higher order 
errors. Asymptotic results are also derived. Thus, this work gives us the bias of the Euler 
characteristic integral as a point estimator for the number of targets in a sensor field, and a 
general understanding of how the Euler characteristic integral fails, or more often succeeds. 



1. Introduction 

Imagine a planar sensor field in a region C that is infinitely dense, with infinitcsimally small 
sensors. Each point x £ f2 has a sensor which records simply the number of objects directly over 
it. Throw many objects of any shape onto this field, and as long as their 2-dimensional projections 
have identical Euler characteristics ^ 0, using the work of Robert Christ and Yuliy Baryshnikov 
([Hi 0): the number of objects (or targets) can be exactly determined using nothing but the sensor 
field data recorded, which is the location of each sensor and how many objects overhead it detects. 




Black = target 

Gray lines = boundaries 
of sensor supports 

Gray region = tangency 
between targets 



Figure 1. A sensor grid. The union of each sensor's 1x1 area makes up the 
entire field. When two targets are almost tangent with no overlap but no sensor in 
between, they become impossible to differentiate. The tangency above (highlighted 
in gray) causes the two targets to appear as one. 
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Now discretize the sensor field, take away its infinite sensor density, and errors will be introduced 
into the above solution of this target enumeration problem. The most obvious error occurs when 
two targets do not overlap, but there is no sensor in between them to differentiate them from one 
another, so they appear instead to be one large target (see figure [T]) - no algorithm could detect 
two targets here unless information on the targets' shapes is given. 

This paper is a numerical analysis of the target enumeration via Euler characteristic integral 
technique given by Christ and Baryshnikov. It proceeds by analyzing the causes of the most common 



first order (section 2.4 1 and second order errors (2.5 1 in target enumeration when the sensor field 



is discretized, then combinatorial arguments are given for the probabilities of these different errors 
under the assumption that the targets' 2-dimensional projections are disk shaped and targets fall 
onto the sensor field under a uniform distribution. With these probabilities calculated, the errors 



can be added back and the correct number of targets can be better approximated (2.6). We then 
derive an asymptotic result for how the Euler characteristic integral behaves for large numbers of 
targets ([s]). 

1.1. A survey of the field. Tomorrow's man will step into a world that is inundated with what 
we call today "sensors." Fields of sensors will count him as he steps onto the street, as he drives 
his hover car, and when he steps onto the battlefield. Tiny sensors will precisely read his DNA. 
Other sensors will tirelessly count the pesticide molecules on his vegetables. Sensors of all sorts will 
appear to read his mind, returning data to him just when he needs it. These same sensors will feed 
smart algorithms working to improve traffic flow, crop development, and battlefield awareness [7j. 
But before this world can exist for our everyman to experience, much work needs to be done. 

The study of sensor networks is nascent, and the fundamentals are still being developed. There 
are many different problems which need to be solved. There are research groups working on network- 
ing, communication, and signal processing algorithms to connect individual sensors into working, 
collaborative sensor fields using minimal computing resources and energy ( 0] , [S] , , [15' , amongst 
many others) . Another team is working on monitoring and making sense of data coming off real 
world sensor fields with broken and malfunctioning sensors [19] . And finally, complete mathematical 
theories for target enumeration and tracking via fields of sensors are just starting to be developed. 

The problem of interest here is target enumeration: given the return of a sensor field, how best 
to fuse the data from the individual sensors so as to count the total number of unique targets being 
picked up by the sensor field. 



Figure 2. A visualization of the return of a typical discretized sensor grid, as will 
be studied throughout this paper. This sensor grid is 200 x 500, and 100 targets 
of radius 6 have been placed over it. A sensor with as many or more targets over 
it then all other sensors is painted white, a sensor with no target over it is painted 
black, and all other sensors are painted on a sliding scale between white and black. 
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Much of the current work in target enumeration or the closely related field of target tracking, 
such as d] , [TT] , and [12] , assumes targets that in one way or another are each easily detected and 
differentiated from one other. However, other teams have worked on solving the complete target 
counting and tracking problem where targets are less easily differentiated from one other and from 
their environment. 

One team has analyzed target enumeration with binary sensors, which each return a result from 
{0, 1} [TS]. They found that for any fixed time, not enough information is contained in a binary 
sensor field to return an accurate estimate of the number of targets. But by looking at time series 
data through a particle filter, both target counts and target trajectories can be nicely approximated 
under the weak assumption of relatively small changes in target velocity vector moment to moment. 
Even using real world sensors, the results are impressive. However, the techniques have so far only 
been developed in one space dimension and no theoretical efficacy analysis has been done. 

Given a sensor field where each sensor picks up a voltage inversely proportional to the distance 
from each target, additive over each target, one team [8] has worked on counting the number 
of targets picked up by this field by counting the number of relative maxima in the returned 
voltages. However, their methods fail when targets are insufficiently separated or are in other 
complex arrangements. The authors try to reduce errors by intelligently turning on and off sensors 
and carefully tracking the effects of noise. They place approximate bounds on the expected errors 
in their methods, but again no close analysis of how errors accumulate under their algorithms has 
been performed. 

The most complete, but also the most theoretical, work to date on a solution to the problem 
of target enumeration has been done by Y. Baryshnikov and R. Christ using the topological tech- 
nique of the integral with respect to the Euler characteristic {[T], [2]). Here the theory has been 
substantially worked out and, at least in the case of infinitely dense continuous sensor fields, is able 
to accurately count targets which are in very complex configurations using sensors which return 
only local target counts. Further, the expected Euler characteristic integral of a Gaussian random 
field has been worked out, which allows for unbiased target enumeration from noisy sensor fields 
[21. However, until now nothing more than conjecture has existed concerning numerical analysis of 
the Euler characteristic integral when our sensor field is discretized. 

We begin here with a preliminary numerical analysis of the Euler characteristic integral as a tool 
for target enumeration, picking up where Baryshnikov and Christ left off. We make exacting and 
precise assumptions, and derive expectations for the number of miscounted targets in a discrete 
sensor field when the enumeration via Euler characteristic integral theory is applied. We gain an 
understanding into how different types of discretization errors occur, and the result is the only 
technique that we know of to unbiasedly, to an approximation, count the number of targets in a 
discrete sensor field returning only local target counts at each sensor. 



1.2. Background. Before we can present the main result we use in topological target enumeration, 
we need a few definitions. The sensors in this paper are thought of as 2-dimenional squares and 
as detecting all targets directly above them, and the targets can be thought of as being not point 
particles but 2-d laminae. A sensor field, discrete or continuous, has a corresponding height function 
h{x) G Z which at each point x in the sensor field 57 C M'^ returns the number of targets the sensor 
at X is detecting. Note that even when the sensor field is discrete, its height function can still be 
thought of as coming from a continuous sensor field. We give the following definition y^: 
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Excursion sets: For functions h : X ^ Z, the set {h — s} is the level set {x E X : h{x) = s}, 
and the set {h > s} is the upper excursion set {x Cz X : h{x) > s}. Lower excursion sets 
are hkewise defined. 

fn the continuous case, the formula for the number of targets (with Euler characteristic = 1) in a 
given sensor field is the duality formula Theorem 4-3 in Christ and Baryshnikov: 

Theorem 1.1 ([TJ Theorem 4.3]). For /i : — )• N constructihle and upper semicontinuous, 

where /3o denotes the zeroth Betti number, or equivalently, the number of connected components of 
the set. 

Constructihle functions are "tame" and integer-valued on a topological space (for a quick intro- 
duction, see [2]). 



While not thoroughly tested in their article [T], Christ and Baryshnikov expect (1.1 1 to be the 
most numerically stable of all equivalent formulae. 

2. Numerical Analysis 

In this paper, we study ideal discrete sensor fields, which are made up of a finite number of 
sensors, all of which are assumed to be performing their function without error. Here, the object of 
our study is discrete sensor fields which are made up of collections of square sensors of area 1x1 
which detect perfectly the number of targets above them. These sensors can also be thought of as 
point particles with square sensing regions. The sensors are then arranged in regular grids with no 
overlap or space among them. 

We define the height function h{x) for the sensor field such that for x not along the boundary 
of any sensor support, h{x) returns the total number of targets above that sensor whose sensing 
region contains x. In our construction of an ideal discrete sensor field, there is no sensing region 
along the boundaries of the sensor supports, and any target without width that lies entirely along 
the boundaries of sensor supports will not be picked up by our sensor field. Then for x along the 
boundary of a sensor support, we define h{x) such that its value agrees with the value returned by 
the sensor underneath or to the left of x. 

This discretization of the sensor field changes the basic behavior of the Euler characteristic 
integral in two important ways, which we now note below. 

2.1. Finite field discretization errors. When using Theorem |1.1| to evaluate the Euler char- 
acteristic integral over a bounded (and connected) region fl, we need to be careful. Suppose, for 
instance, the height function ft, = 10 throughout il, so that h never vanishes along dil. Then the 
calculated value of the integral using Theorem [TT] is 20. This differs fr om the correct value of 10 



by a factor of 2 (see figure 3a). To correct this error, we can zero-pad h by constructing a function 
g such that. 



h{x) if a; e 
iix^n 



and then integrating g over instead of h over ft. For the rest of this paper, this correction is 
always assumed to be made when working with finite, discrete sensor fields. 
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(a) With no zero- 
padding, J^hdx = 
20 




(B) 
zero 



With 
padding, 
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Figure 3. The Euler characteristic integrals of the returns of these two sensor 
fields show the effect on the integral caused by zero-padding the sensor field data. 



2.2. Linearity of the discrete Euler characteristic integral. Under discrete sensor fields, 
linearity of the Euler characteristic integral (which is true for continuous sensor fields) does not 
hold. Take for instance two height functions {hi and /i2) each generated by sensor fields of four 
sensors arranged in a square grid, where one small target is placed on each field. Then, with black 
indicating function value of 1 and white indicating 0, 






Note that hidx — l and /^^ /12 dx = f , but ^^{hi + (ix=l7^1 + l = 2. Thus, over discrete 
sensor fields the Euler characteristic integral is not a linear operator. 

However, one particular instance of linearity still holds for discrete sensor fields. 

Lemma 2.1. For any positive integer n, 

(1) / f^hdx^n f hdx 

Jn j^j^ Jvt 

Proof. This is a result known [5] to hold over continuous sensor fields, but as seen above that does 
not necessarily mean the result holds over discrete sensor fields. Thus we argue as follows to show 
the result continues to hold over discrete sensor fields. 
Note, 



(2) 
(3) 



f3o{nh > nk + i} = l3o{h > k} 
Po{nh <nk + i} ^ Po{h < k} 
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for all i = 0, 1, 2, . . . , n — 1 and for all fc = 0, 1, 2, 3 . . . . Then, 

oo 

'^{Mnh > s}- Poinh <s} + l) 

s=0 

n—1 oo 

'^{/3o{nh > nk + i} - P^inh < nfc + i} + 1) 

i=0 fc=0 
n — 1 oo 

oo 

nY^iMh > k} - Po{h < k} + 1) 

n hdx 
Jn 

□ 

2.3. Analysis of target enumeration. The numerical properties of Theorem as a point 
estimator for target enumeration in discrete sensor fields, depend entirely on the magnitude, shape, 
and distribution of the targets, as well as the setup of the sensor field. For example, the smaller 
the targets, the less likely a tangency is to occur for a given sensor field size. We first consider tiny 
targets, smaller than the individual sensor size. Given a large sensor field, the probability that two 
of these tiny targets will appear tangent and cause an error is much smaller than when large disks 
are being placed on a sensor field of the same size, as each large disk has a large perimeter to which 
another target can come in contact. Noncircular objects have their own sets of problems. Imagine, 
for instance, a star-shaped target and a disk-shaped target of similar magnitude: the disk has far 
less perimeter, and cannot interact with other disks in the way a star could interact with other 
stars. Moreover, the more uniformly spread out our targets, the less likely an error is to occur, in 
contrast to all the targets being concentrated in a single part of our sensor region fl. So, for our 
analysis we will impose a few constraints: 

(1) There are n targets, each of which are disks of the same integer radius r < 7. 

(2) Targets are discretely placed with integer center coordinates. 

(3) The equation for the discrete disks which will approximate our targets is + < and 
not -\- < as the latter formulation introduces extraneous features into the discrete 
disks (see figure [4]). 

(4) Sensor field region is rectangular, I x w, and the sensor field is made up of a regular grid 
of I ■ w sensors, each having a size of 1 x 1. The union of the sensors gives us the complete 
sensor field. 

(5) Targets' centers are uniformly distributed over our region. 

(6) To prevent edge effects, targets' centers are kept a minimum of r units away from edge. 

For a detailed analysis of discrete integer radius circles with integer centers (known as Freeman 
digitizations) see [T5] . 

We proceed with calculating the expectation of J^2 h dx when the integral is calculated using 
Theorem 1 and the assumptions are as above. First, note 



and then by ([2| and ([s]), 



which proves the statement. 
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(A) x2 + y2 < q2 (b) ^2^y2 ^ g2 

Figure 4. A comparison of two different formulae for discrete disks. 



(4) 



E ( / hdx] = n — E(number of errors) 



where errors are the number of miscounted targets, the deviation from the true value of n targets. 

2.4. First order approximation. A first order approximation would be to count the number of 
tangencies amongst the targets, and take this as an estimate of the error. Let's do that. 

E(# of errors) ss E(# of tangencies) 

= E tangencies on disk i^ 

n 

= E (# tangencies on disk i) 

i=l 

n 

= ^^(ri — 1) • Pr(one of the other n-1 disks comes tangent to disk i) 

i=l 

= n • (n — 1) • Pr(one of the other n-1 disks comes tangent to disk i) 

Then ignoring edge effects for a moment, 
(5) = n • (n - 1) • ^ 



{x-2r){y-2r) 

This is by linearity of expectation, the disks' centers being independently and identically distributed 

uniform random variables, and the fact that only an area of (x — 2r){y — 2r) is available for the 
centers to be placed. Here b = 88, for example, in the case of radius = 6 disks. There are 88 points 
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around one disk where the center of another identical disk can be such as to cause a tangency - as 
shown by the white ring around the radius = 6 discrete disk in figure [5] 

It is important to note that due to edge effects, even as an expected value of the number of 
tangencies and not errors, the above formula is an approximation. The average radius = 6 disk 
picked up by our sensor field doesn't have quite 88 spots around it where another disk can be placed 
to cause tangency; disks closer to the edge cannot have another disk placed to one side of them 
(see figure [6]) . So ([s]) becomes a better and better approximation to the number of tangencies as 
x^y oo. The magnitude of this error is most notable when placing radius r disks down on a 
2r X 2r sensor field without boundary effects: no matter now many disks are put down, as there is 
only one location where the target can fall, no tangencies will occur. 

We can correct for edge effects thusly. Assume our region is larger than 4r x 4r. Put a disk 
at each of the 2 • radius points closest to the boundary, and for each disk calculate at how many 
points in our region another disk could be placed to cause a tangency. Over the 2 • radius boundary 
locations, calculate the average number of tangency points for a disk centered at each location. 
Then we can calculate a weighted average representing the number of tangency points the average 
disk in our region is expected to have, which is a corrected value for our number of tangency points 
and is lower than the value calculated neglecting edge effects. 

Let a = average number of tangency points in boundary region and /3 = number of tangency 
points of disk neglecting edge effects such that, in a region il of dimension / x w, 

corrected (average) tangency points for a discrete disk (of radius r) in CI — 
{I ■ w - {w ~ 4- r) ■ (l - A ■ r)) ■ a + {w - 4 ■ r) ■ {I - A - r) ■ 13 
total area of Ct 

Experimental results (table [2]) suggest equation ([s]) holds as an approximation to the number of 
errors as long as targets (disks in this case) are not too densely packed allowing for many higher- 
order errors to occur; but this approximation still becomes poor very fast. 

So we need to find the next set of second order errors. 

2.5. Second order approximation. 

Theorem 2.2 (Second order error approximation formula). For simplicity assuming a radius of 6, 
letting SScorrected denote the number of tangency points of a radius 6 disk corrected for edge effects 




Figure 5. The white ring contains 88 points where a disk of radius 6's center can 
be placed, such as to cause a tangency with the center disk of radius 6. 
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and A — (x - 
(6) 



2r){y 



2r), a second-order estimate of n — (Jj^a hdx) is given by 

^Sco7'rected / 



A-A- 



where c is a proportionality constant and when the conditions assumed in the beginning of section 
[3 are met. 

As a first step towards proving let's calculate the Euler characteristic integral of a few 
different sensor fields to get a feel for how errors accumulate. See figure [7] 

In |7a[ there is one tangency and one error. In |7b[ however, there are two tangencies but still 
only one error. The third disk put down over the second discriminates itself by creating a new 
2-segment in the level sets (i.e. there is now an additional connected component in the {h = 2} 
level set). This is the key insight in our derivation of (|6|: going from 7a 



to 



7b the total number 



of errors does not increase. Appropriate generalizations also apply. We can shift the 3rd disk (see 
7c), and as long as it still discriminates itself against the 2nd disk the 3rd disk's tangency with the 
first disk does not lead to an error. Note similar phenomena in [7d] and [7e] 
On our way to proving ([6|, the above first suggests the following lemma. 

Lemma 2.3. Given a discrete sensor field with n disk-shaped targets of radius 6 discretely and 
uniformly placed over it as assumed in the beginning of section^ 

■ En-1 



(7) 



E„(# of errors) w E„_i + 



Border region - no centers can be placed here 




Figure 6. Any radius — 6 disk less than 2 • radius from the edge of the region 
has less than 88 points around it which can cause a tangency if another disk is 
centered at one of them. This on average reduces the constant b in the tangency 
calculation formula. 
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(a) 2 targets, f^hdx = I (b) 3 targets, f^hdx = 2 (c) 3 targets, f^hdx = 2 





+1 error O 
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+1 error 














+ 1 error 



(d) 4 targets, J^hdx = 3 (e) 5 targets, /i = 3 

Figure 7. Above are the height functions returned by 5 different sensor fields, 
and their Euler characteristic integrals. 



Proof. Put down the first target, 

Ei(# of errors) = 
Put down the 2nd target, and there is one target it can come tangent to, 

1^2 (# ot errors) = ^ 

Put down the 3rd target and there are two targets it can come tangent to in such a way as to create 
an error, but some some positions will also cause the 3rd target to be discriminated. The number 
of such positions is related to the number of tangencies already present, the expected number of 
which we already calculated above. So, 

SScorrected , 2 • SScorrected (previous number of tangencies) 
E3(# of errors) w j + j c j , 

by the uniform distribution of target centers, and where c is a proportionality constant. We are 
subtracting off from the probability that the 3rd target creates a tangency, the probability that the 
3rd target does not create an error, for as noted going from [Ta] to [7b] the number of errors does not 
increase. 

/ 1 1 r \ ^^corrected , 2 ■ SScorrected ( '^^corrected 

E3(# ot errors) ~ 



A A \ A^ 

IScorrected ( '^^corrected 



A \ 

For radius = 6, and given two tangent disks, there are on average (for the 88 possible setups. 



with a "center disk" held fixed) 21.455 points where a case such as either [7b] or 7c can occur, so we 
take c = 21.455 for second-order errors (see figure[8]). The value 21.455 is an exact value calculated 
by an exhaustive search of all possible points for a third disk, averaged over all 88 possible setups. 
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of 7a 7b 



Next, put down the 4th target. Note that in our second-order-crror world, E3 equals the number 

^ ^ ^ ^ 7c type situations that are expected to occur. We take each of these situations 

represented by E3 to have on average c = 21.455 points around them where another disk can be 
centered such as to not cause an error, while still causing a tangency (see figure [S]). So, as part of 
calculating the expected number of errors, when calculating the number of tangencies adding a 4th 
target is expected to create, we subtract off the probability of one of these tangencies being created 
by a target being centered at one of the c • E3 "good points." So, 



E4(# of errors) w E3 H — \ ^ 

6 ■ SScorrected 4c • SScorrected ^ ^^corrected 

J I2 + J3 



What we are studying here is the case where a "center" disk is held in a fixed position, has a 
height function of 1, and there is a 2nd disk tangent to this center disk. A 3rd disk tangent to the 
center disk can be sufficiently near the first tangency such as to not cause an error - so that this 
one grouping of three disks results in only one error. A fourth disk can be brought in tangent to 
the center disk, but still discriminate itself against the second or third disk. This one grouping still 
only contains one error. 

There can be two tangencies to the first center disk which don't interact, and this creates two 
separate groupings, each accounting for one error (see figure 7e|. 

In summary, note how each tangency creates a hotspot around which tangencies can cluster with 
only the first one counting as an error. These spots are what we are counting. E„ is a count of 
these groupings, as each grouping equals one error, so when we want to predict how many "good" 
spots there will be for the (n + l)th target to land, we know it will be approximately proportional 












^^^^^^^^ 













Figure 8. Around this cluster of targets, in our second order approximation with 
the left "center" disk held fixed with h = 1, there are on average (for the 88 
possible positions of the second disk) c = 21.455 positions where a third disk could 
be centered such as to be tangent with the center disk, but not cause an error by 
being discriminated by the second disk. The dots represent a few of these "good 
points." 
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to E„. Continuing in this way, 

/ // r \ , ^ ' ^^corrected ( E4 

E5(# of errors) « E4 + c( — 

10- 

^^corrected 

lOc- 

SScorrected , 5c SScorrected C SScorrected 
A ^2 + ^3 Ji 

Finally, 

E„(# of errors) « E„_i + ^ '— — 

which is □ 

Continuing on with our proof of note that Q is a first-order linear inhomogeneous recurrence 
relation with variable coefficient (s), a class of recurrence relations which has an elegant solution 
[14j . given by the following lemma. 

Lemma 2.4 ([14 ). The solution to a general first- order linear inhomogeneous recurrence relation 
with variable coefficient(s) in the form a„+i — fnOn + ffn , fn^O is given by 

(8) a„ = fn A (^0 + E yfi^] 



Proof. Starting with a„+i = /„a„ + g„ , /„ 7^ 0, 



1n+l — fnO"a + ffn 
^n+l fn^n — 



Cln+1 /nin 9n 



Let 



then 



- -prn-l . ' 



n— 1 n— 1 

3>i 



E (A,„+i - A,„) = A„ - Ao = E 



m=0 rti—Q 

_ 971 

n 

7n=0 ■ 

/ n-1 

5>7 



nn~l f ' 77™ f 



v/c=0 / \ m=0 



rife^o A- 



which is (|8|, the result we desired. 



□ 
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We are now ready to prove 

Proof. In (HI take a„ = E„ = E„_i (l - ^) + , substitute n + lin this relation 

to get a„+i = E„+i = E„ (l — ^) + "^^""J^''"*'"' , and then we compare this equation with a„+i = 
fnCin + Qn , and take /„ = (l — ^) and 5„ = FinaUy, noting that = as ao = (if no 

targets are put down, there are no errors), we plug what we know into ([s]) to get. 



„-2 / 1 \ "1-1 



^\ '-^^ corrected / -, \ \ ^ 

m=l ^ ^ 

Which is in the form of X]fc=i kr^~^ , a solution to which can be found by differentiating the standard 
geometric series formula with respect to r, 

a(l-r"+i) 



(11) E 



^ ' dr ^ (l-r)2 1-r 



fc=0 fe=l 



Applying ( 12 ) to ( 10 1 gives, after some algebra. 



^ »«co.rec«erf _ " . ^ _ ^ ^ „^ 

which is a relatively simple solution to our recurrence relation ([7|, and is the desired result 
(§. □ 

2.6. Immediate applications. If we take the limit of (n — ^^^"^a"*^'^ ((l — -|-)" ■ A — A + ncj) as 

n — > oo, 

(13) 



lim 

n—i-oo 



n — 



-'corrected 



where < c < A, we note that {n — ^^""g*"'*'"' ((l — ^) ■ A — A + ncj) diverges if c > 88corrected 
and it can become negative if c < S8corrected- For this reason we try setting c — SScorrected- 

We then note that equation ^ with c — 88corrected takes into account higher order errors and 
holds up very well, even when we "flood the gates" so to speak by having many targets very densely 
packed together. Moreover, equation ^ with the this same value of c continues to hold when targets 
are sparsely distributed, indicating that ^ is robust (see table [2] for numerical results for large and 
small n). 

In order to derive an improved point estimator for n, we note that for a fixed target radius and a 
fixed sensor field area, and with 88corrected < c < A, the function defined by taking n to the value 
given by 



(14) n ^((1 



c 

is injective. 



■ A- A + nc\ 
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Arguing by contradiction, we assume there are ni and n2, ni ^ n2, and ni > 712, such that, 
then 

{ni -na) / - = f 1 - -V - f 1 - - V 

^ \S>8corrected J }• A J V ^ 

>0 by assumptions <0 assumptions 

which under our assumptions {SScorrected < c < A and rii > 112) is a contradiction. Therefore, 



ni = 71-2 and the function which takes n to the value given in (14) is injective. 
Thus, given an observed value riobs of hdx, we can solve 

nobs = n-^^2^^^(^(l-^)".A-^ + , 

numerically for n, and obtain a better estimate for n than simply Uobs- As ^ is approximately 
unbiased, this method of obtaining an improved estimate of n is also approximately unbiased. 

Looking to expand our assumptions, an interesting case is when the sensor field is not only 
discretized, but the targets' centers are also continuously placed on this sensor field - a much more 
useful application than the discrete placement case. Imagine, for example, a discrete sensor field in 
front of you. You drop a pile of quarters onto it. Almost surely, not a single quarter will land with 
its center exactly at a point where four sensors meet; each quarter has the opportunity to drift in 
one direction or another. The slightest perturbation in a quarter's center coordinate away from an 
integer will cause it to land on sensors such as to distort its perceived shape and cause new and 
unforeseen errors (see figure [9]). Research needs to be done into this continuous placement case. 

Looking to further expand past our assumptions, the case when r > 7 is most interesting. 
With r > 7 disks placed discretely on a discrete sensor field, more than simple tangencies can 
occur. Discretized disks of larger radii can exhibit concave behavior when two identical disks join 
in certain precise ways to create a single element in the level sets with an Euler characteristic of 
zero. Larger disks can also exhibit another type of "locking behavior" where two disks join together 



to have an integral with respect to the Euler characteristic of 3 (see figure 10). The number of 
positions around a disk that another disk's center can be placed to cause these different effects 
varies from radius to radius, and no pattern seems to emerge (see table [l] and figure 11 ). 



An analysis similar to that done for radii less than 8 should be applicable to greater radii. 



Figure 9. This is the level set generated from two radius = 6 disks falling onto 
a discrete sensor field with continuous placement of their centers. New behavior 
arises in this case. Here J^hdx = 0, which does not occur with discrete placement. 
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Figure 10. Above are examples of the ways targets of radii > 7 can join together 
to create different types of errors when evaluating h dx over a discrete sensor 
field. 

See table [2] and figure [12] for numerical examples of our results. Observed data were generated 
from a virtual sensor field environment created in GNU Octave [5] . 

3. Asymptotic Behavior 

Given an understanding of how errors in target enumeration accumulate for small n, the question 
then arises, how does Theorem |1.1| behave for large n? In answering this question, a connection 
arises to a problem in probability called the coupon collector's problem. Given a set of p coupons 
of all different types, how many coupons does the coupon collector expect to need to draw with 
replacement before having at least one copy of each type of coupon? A similar question is, "How 
many coupons does the coupon collector expect to need to draw with replacement before having at 
least q copies of each type of coupon?" A solution to this more difficult multiple coupon problem is 
given by |16| . The link between this multiple coupon problem and Euler characteristic integrals is 
seen when we note that if for some integer s the {h = s\ level set covers all of f2, then by Theorem 
|1.1| this level set contributes s units to the target enumeration count. If, for instance, each sensor 
in a sensor field picks up 10 targets, then the {h = 10} level set will cover all of 17, and Theorem 
|1.1| will return a target count of 10. In [16] it is shown that for large g, the expected number of 
coupons the coupon collector needs is asymptotic to pq by the law of large numbers. Thus, for a 
regular grid of sensors that is I sensors long and w sensors wide, and for targets that are the size of 
the sensors and discretely placed over the centers of sensors, this is saying that for large n, every 
Iw targets placed over the sensor field increases by one the number of the highest level set that 
covers all of f2.. Then, if you put down mlw targets for some large positive integer m, you would 
expect the {h — m} level set to cover fi, which would account for mlw targets, leaving a total of 
mlw — mlw — targets to fill the higher level sets. In other words, our examination shows that the 
highest non-empty level set of h covers fi, and all higher level sets are empty and do not contribute 
to the Euler characteristic integral. 

To work more generally with any setup of sensor field (assumed to be in a region f2), and any 
(fixed) target shape, with targets placed uniformly and discretely, we proceed as follows. Construct 
a height function h' which is the return of a sensor field in which each sensor has a target centered 
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radius 


Error Type 




Error Type 





1 


3 


radius 





1 


3 


1 





8 





51 


168 


584 


80 


2 





24 





52 


152 


600 


88 


3 





40 





53 


112 


584 


96 


4 





56 





54 


136 


576 


88 


5 





72 





55 


120 


632 


80 


6 





88 





56 


200 


600 


96 


7 





104 





57 


176 


632 


112 


8 


8 


112 


8 


58 


248 


632 


112 


9 





136 





59 


176 


632 


88 


10 


16 


136 


16 


60 


200 


696 


112 


11 


8 


160 


8 


61 


184 


624 


120 


12 


8 


176 


8 


62 


176 


680 


112 


13 


16 


176 


16 


63 


192 


672 


104 


14 


8 


208 


8 


64 


248 


704 


112 


15 


24 


200 


24 


65 


216 


744 


88 


16 


16 


224 


16 


66 


192 


688 


96 


17 


32 


208 


32 


67 


184 


720 


88 


18 


40 


248 


32 


68 


280 


728 


152 


19 


16 


272 


16 


69 


224 


712 


136 


20 


32 


256 


32 


70 


208 


752 


128 


21 


16 


288 


16 


71 


200 


704 


112 


22 


56 


280 


48 


72 


184 


736 


112 


23 


40 


312 


32 


73 


280 


768 


112 


24 


32 


320 


32 


74 


184 


768 


104 


25 


48 


304 


48 


75 


328 


752 


152 


26 


56 


344 


48 


76 


208 


752 


128 


27 


72 


336 


56 


77 


232 


824 


120 


28 


48 


352 


48 


78 


288 


792 


128 


29 


32 


376 


32 


79 


216 


792 


112 


30 


72 


392 


48 


80 


264 


768 


128 


31 


48 


408 


40 


81 


304 


856 


128 


32 


88 


416 


48 


82 


248 


832 


152 


33 


48 


400 


56 


83 


320 


840 


152 


34 


72 


408 


48 


84 


240 


848 


128 


35 


88 


424 


64 


85 


376 


888 


136 


36 


56 


440 


56 


86 


312 


864 


152 


37 


96 


440 


56 


87 


352 


832 


168 


38 


72 


472 


48 


88 


264 


864 


136 


39 


136 


456 


80 


89 


224 


856 


152 


40 


104 


488 


64 


90 


336 


888 


160 


41 


80 


488 


72 


91 


336 


920 


176 


42 


96 


504 


48 


92 


320 


872 


160 


43 


56 


536 


56 


93 


304 


944 


120 


44 


144 


512 


80 


94 


376 


952 


152 


45 


104 


544 


64 


95 


368 


912 


184 


46 


152 


496 


88 


96 


352 


960 


152 


47 


152 


552 


88 


97 


344 


904 


168 


48 


96 


536 


80 


98 


400 


1008 


128 


49 


136 


520 


104 


99 


448 


912 


200 


50 


112 


632 


72 


100 


408 


952 


184 



Table 1. For varying radius and discrete placement with integer centers, the 
above table gives the number of positions around a discrete disk where, when 
another identical disk is placed there, J^2 hdx =0, 1, or 3 (which we call type 0, 
1, or 3 errors). 
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type errors O 
type 1 errors — B 
type 3 errors — ^ 




radius 



Figure 11. A plot of table 1 



over it. Let, 

(15) f h'dx 

Let a be the number of sensors in our sensor field. Then when n targets are uniformly and 
discretely put down, where n = m ■ a for some positive integer m, and where h is the height 
function coming off this sensor field, by ([T]), ^15^ and the above reasoning, we have 

Theorem 3.1. 



(16) 



f hdx — Hra, as n —> oo 
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n 


observed 


Formula ([5j) , 

^ — 88correcied 


Formula (|6| , 
c = 21.455 


Formula ([6|) , 

^ — 88correcietZ 


100 


98.294 


96.5 




98.2 


98.2 


200 


193.12 


185.7 




192.9 


193 


300 


284.72 


267.9 




284.1 


284.5 


400 


373 


342.8 




371.7 


372.7 


500 


458.5 


410.6 




456 


457.8 


600 


541.04 


471.2 




536.8 


540 


700 


619.55 


524.7 




614.2 


619.2 


800 


697.11 


571.0 




688.2 


695.7 


900 


771.23 


610.1 




758.9 


769.4 


1000 


843.67 


642.1 




826.3 


840.6 


2 000 


1 446.2 


567.6 




1 324.9 


1428 


3 000 


1 881.5 


-223.4 




1 524.1 


1 838.5 


4 000 


2 178.9 


-1731.0 




1 449.7 


2 125.4 


5 000 


2 390.0 


-3 955.2 




1 125.2 


2 325.9 


6 000 


2 524.1 


-6 895.9 




572.1 


2 466 


7 000 


2 611.1 


-10 553.2 




-189.7 


2 563.9 


8 000 


2 663.2 


-14 927 




-1 142.4 


2 632.4 


9 000 


2 689.8 


-20 017.4 




-2 269.4 


2 680.2 


10 000 


2 703.2 


-25 824.3 




-3 555.9 


2 713.6 



Table 2. For a 500 x 500 discrete sensor field with n radius — 6 targets put down 
uniformly and discretely, the observed value (averaged over 1 000 trials) of /jjj h dx 
is presented, along with E (/g2 h dx) calculated using ^ and then the second-order 
(|6| with both the second-order value of c and with c = SScorrected- For this region, 

88correcied ~ 85.322. 



4. Conclusion 

The main result in this paper is (|6]), which gives an approximation to the bias of the Euler 
characteristic integral as a point estimator for n, the number of targets over a discrete sensor field. 
Numerical results show that equation ^ agrees closely with obersvations. 

But this is just a preliminary analysis, and there is still much to be done. There are also many 
constraining assumptions we made here which are ripe for relaxing. Areas of further research might 
include looking into, 

• The discrete Euler integral's complete probability distribution, or even just its variance. 

• A generalization of the numerical analysis theory to any distribution of targets. 

• A generalization to any target shape (and not just 2-d projections of discrete disks!). 

• Generalizations to multiple target shapes on the same field. 

• Non-regular sensor grids. 

• The error tolerance and stability of the discrete integral in the face of noisy data, broken 
sensors, or other real world problems. Future work should include a numerical analysis of 
the theoretical work done on the Euler characteristic integrals of Gaussian random fields 
[3], which permits, at least theoretically, unbiased target enumeration off noisy sensor fields. 
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• The techniques for discrete Euler integral analysis and signal processing suggested in [5] 
and [J. 

• Improving our target enumeration estimate by analysis of time series data for moving 
targets, similar to |18| . 

• Extending the work done here to targets that arc continuously placed onto a sensor field, 
as opposed to discretely placed with integer center coordinates. 

Possible applications of any numerical analysis will require additional work. All theory discussed 
in this article has been based on a priori knowledge of target shape and target distribution. This 
suggests possible areas of research into relaxing of these requirements in some way. We should try 
to determine estimates for these parameters from the data we are given - a sensor field. 
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